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1. Introduction 

Let / be some function from the positive integers {0, 1,2,...} into (0, oo), and let 
G be a graph and xq some vertex of it. Then we define reinforced random walk 
on G starting at xo with reinforcement function / as follows. It is a walk on G 
i.e. a random sequence of vertices xq, x\, . . . with Xj+i a neighbour of xf, and the 
transition probabilities depend only on the past. Specifically, for an edge e and 
time t we define the number of traversals N(e, t) using 



where (x s _i , x s ) is an unordered edge, so N counts traversals of e in both directions. 
Now apply / and normalize to get probabilities. Totally, the definition is 



where ~ is the neighbourhood relation in our graph. This finishes the definition 
of the process. This is well defined assuming all degrees of all vertices are finite, 
and we assume this from now on on all our graphs. There are also versions where 
reinforcement is applied to vertices or to directed edges, but for simplicity we 
concentrate for now on edge reinforced walks. 

It comes as no surprise that different / give different processes, sometimes 
dramatically so. To make things more concrete let us discuss a few examples. 

(1) Once-reinforced random walk is the process defined by 



N(e,t) = #{s such that 1 < s < t, (x s -i,x s ) = e] 



P(x t+ i = x\x ,...,x t ) 



f(N((x,x t ),t)) 



E f(N((y,x t ),t) 
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In other words, edges which have already been traversed are given double 
weight when compared to unknown edges, but the number of traversals does 
not matter. 

(2) Linearly reinforced random walk is the process defined by f(n) — 1 + n. In 
other words, whenever you traverse an edge, you increase its weight by 1. 

(3) Strongly reinforced random walk, where / grows faster than linear. For 
example, one may take f{n) = 1 + n 2 . 

Perhaps surprisingly, the order of difficulty seems to be decreasing. Once-reinforced 
random walk is the least understood, while the supcrlinear case is the simplest. 

Let us start with the supcrlinear case. It turns out that in this case the walk 
gets stuck on a single edge, going back and forth endlessly. To understand why, 
let us prove the following weaker claim: 

Theorem 1. Let G be an arbitrary graph, xq a vertex of G and e an edge going out 
of x. Examine reinforced random walk on G starting from xq with reinforcement 
function 1 + n 2 . Then with positive probability the process goes back and forth over 
e endlessly i.e. for all t, X2t = xo an d X2t+i is the other vertex of e. 

Proof. Denote the other vertex of e by y. Let E s be the event that x^t = %o and 
%2t+i = y for all 2t < s and 2t + l < s, respectively. Let us calculate the probability 
of E s+ i given E s . The numbers of traversals are N(e, s) = s and N(f, s) = for 
any edge / ^ e. Hence the probability is 



f(s) + (degx s - l)/(0) 1 + s 2 + (degx s - 1) " s 2 ' 

for some constant C that depends only on the degrees of xq and y. Therefore the 
probability of E s is 

P(E S ) = f[ P(E U \E U ^) > f[ (l - O (\)) > c 



for some positive c. This proves the claim. □ 

There is no zero-one law for the behaviour of processes like reinforced random 
walk, so one cannot easily conclude from the theorem above that some event of 
interest happens with probability 1. Nevertheless, this has been proved with almost 
complete generality. 

Theorem 2. Let G be a graph and f a function satisfying 

oo ^ 

fin) 

n=0 J K ' 



Then reinforced random walk on G with reinforcement function f eventually gets 
stuck on a single edge with probability 1, under either of the following conditions 
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(1) f is increasing, or 

(2) G contains no cycle of odd order. 

See Limic [15] and Limic and Tarres [16, 17]. It is conjectured that neither 
condition is necessary, but clearly, the result is quite complete as is. 

2. Phase transitions 

Let us now consider once-reinforced random walk, but let us add a parameter o 
and examine the reinforcement function 



An exciting conjecture due to Beffara & Sidoravicius is that, say for the graph Z rf , 
d > 3, there is a phase transition in a. For a very small the walk is essentially 
simple random walk. For a very large the walk looks like a man trapped in a 
balloon: it constantly jumps against the boundaries of the balloon, occasionally 
inflating it, but the balloon keeps its round shape, as it inflates. The "balloon" , of 
course, is the set of visited sites. In particular for d > 3 there should be a phase 
transition between recurrence and transience. Let us stress that a does not depend 
on t. It is not difficult to establish results if one first fixes t and then makes a very 
small or very large. In this case for a <C j one can directly check that the process 
is identical to simple random walk (with high probability), while for a 3> t the 
process gets stuck on its first edge. But the conjecture is that for a fixed a there is 
a marked difference in behaviour as t — > oo. Some simulation results can be seen 
in figure 1. Each simulation was done on a 5000 x 5000 grid and stopped when 
the process reached the boundaries. The 2-dimcnsional case was chosen since it is 
easiest to depict, but one should probably caution that in fact it is not clear what is 
the true behaviour in 2 dimensions, even on a conjectural level. Colour depicts the 
number of reinforced edge. This means that the second edge which is reinforced 
is given colour 2, not the second edge which is traversed (colour change rate is 
not identical in the 6 images). The fractal- like colour patterns seen in 1 + a = 4 
are indicative of the process leaving holes all around and then going back to fill 
them at a macroscopically later time. Presumably 4 is quite close to the point of 
criticality where the processes of roughening and smoothing balance exactly. 

The case of large a in this conjecture is related to a scries of similar models. In- 
ternal diffusion-limited aggregation is a process where the walker, upon reaching a 
new vertex, is transported to the starting point Xq. For this model it is known since 
the early 90s that the set of visited vertices is spherical [14, 13], with impressive 
improvements in the precision of the results achieved lately [11, 12, 2]. A variation 
where, when the walker, upon reaching a new vertex is transported to a random 
vertex in the existing cluster also gives a ball [3] . Getting closer to once-reinforced 
random walk we may examine excited to the center. In this model the walker, 





l + a = 6 1 + a = 10 

Figure 1. Once-reinforced random walk 



Reinforced random walk 



5 



upon getting to a new vertex gets a small drift towards the center. Less is known 
about this problem, basically only that the process is recurrent in any dimension, 
regardless of the strength of the drift [unpublished, 2006] . Once-reinforced has all 
the difficulties of excited to the center and more: there is no preferred direction 
and there is a phase transition. Hence it seems quite challenging at this point. 
Existing rigorous results on the model are on a tree [7], and it is not difficult to 
solve the one-dimensional case, but neither case exhibits a phase transition, so for 
now this phenomenon is beyond reach. 

Let us close this comparison by returning to the simulation results above. As 
can be seen, the shapes for 1 + a = 5 or 6 are not that round. This is unlike internal 
diffusion-limited aggregation where even quite small simulations are very close to 
a ball. A similar phenomenon can be seen in simulations of excited to the center 
done by Yadin. This is simply indicative that the boundary smoothing mechanism 
is far less efficient in these models. 



3. Polya's urn 

We now move to the main topic of this talk, linearly reinforced random walk, 
LRRW for short. Naively one might conjecture that, since ^2 b diverges so slowly, 
maybe the behaviour of LRRW is not that different from the behaviour of strongly 
reinforced random walk. Let us start with a simple example that shows that this 
is not at all the case. The example will be a graph with 3 vertices connected in 
a line, and in being the middle vertex. The reinforcement function 

• • • 

will be 2 + n. With this weight, the process is equivalent to the 
classic Polya urn. Recall that in the Polya urn model one repeatedly takes out 
a ball, and then returns two balls of the same colour as the ball you took out. 
Looking at the weights of the LRRW at time 2t (i.e. the values of f(N(e,2t))), 
and dividing them by 2, one gets a Polya urn. Let us recall Polya's theorem [23] 

Theorem 3. At time t (i.e. when the urn contains t+l balls), the probability that 
it contains k black balls is exactly j, for any value of k between 1 and t. 

Proof. Assume inductively that the claim has been proved for t — 1. Now, to get 
k black balls at time t you need one of the following: 

(1) Have k — 1 black balls at time t — 1 and take a black ball out of the bin; or 

(2) Have k black balls at time t — 1 and take a white ball out of the bin. 
The probability of (1) is 

1 fc-1 
t - 1 ' t 

where the term l/(t — 1) is our inductive assumption, and the term (fc — l)/t is 
the probability to pick a black ball when you have k — 1 black balls out of a total 
of t balls. This formula also holds for k = 1, though for a different reason: here 
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the probability is zero since black balls are not allowed, and the formula indeed 
gives 0. A similar calculation shows that the probability of (2) is 

1 t-k 
t - 1 ' t 

Summing (1) and (2) gives 

1 (k-\ t- k\ _ 1 
t - 1 ' \ ~t ' j~) ~ t 

and the induction is complete. □ 

Thus we see that the picture is very different from that of strongly reinforced 
random walk. Looking at strongly reinforced random walk at some very large time 
would give that with probability very close to 1, the walk visited one of the edges 
a very small number of times, and the other edge all the rest (from symmetry 
each one gets probability close to |). Here this is not the case, typically each edge 
is visited a number of times proportional to t. On the other hand, this is also 
very different from what we expect for, say, once-reinforced random walk. It is 
not difficult to analyze this explicitly and see that once-reinforced would have the 
number of visits to each edge at time t being |t, with the random errors being of 
order s/i. So we see that the linear case is quite special. 

The analogy to Polya's urn works in a number of other cases. Examine the 
graph Z. Examine a single vertex. Since there are no cycles, you know that when 
you leave your vertex through some edge e, you are bound to return through e, if 
you return at all. The other vertices can only effect you through preventing return, 
not in any other way. Thus you can couple LRRW on Z to a sequence of i.i.d. Polya 
urns. The same holds for any tree, except that you get multicoloured Polya urns. 
But these are also well understood. Using these ideas, Pcmantle [22] was able to 
analyze LRRW on regular trees. He showed that there is a phase transition in the 
initial weight a 

Theorem 4. Let G be a regular tree of degree > 3, and let a + n be the reinforce- 
ment function. Then there exists an a c such that for a < a c LRRW is (strongly) 
recurrent, while for a > a c it is transient. 

Here strong recurrence means that the expected time to return to your starting 
point is finite. This is significantly stronger than usual recurrence: infinite graphs 
(like Z or Z 2 ) are recurrent but not strongly so (for Z this is the well-known 
"gambler's ruin" theorem). In a way it means that the process behaves like random 
walk on a finite graph. Again, compare to the case of once-reinforced random walk 
on a tree which is transient for any value of the parameter [7] . 

Theorem 4 shows that importance of the parameter a, and from now on we will 
simply say "LRRW with a" rather than the cumbersome "LRRW with reinforce- 
ment function a + n" . Note that the parameter a has the opposite effect compared 
to once-reinforced random walk: there small a gave random-walk-like behaviour, 
while here it is the large a that give this. 
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Figure 2. Partial exchangeability 



Another case where the theory of urns is applicable is directed reinforced ran- 
dom walk. One modifies the definition of N to count directed traversals i.e. 

N dil {x, y, t) = #{f < s < t : x a - X = x, x s = y} 

and then again define x s by taking a + N d " and normalizing it. Here it is not 
important what graph one looks at, each vertex can be isolated and treated as a 
multicoloured urn independent of all others. Nevertheless, this urn argument only 
reduces the model to a kind of random walk in random environment, a notorious 
problem in its own right. This was tackled on the graphs Z d by Enriquez and 
Sabot [9, 24]. Interestingly, there is no phase transition when d > 3, the walk is 
transient for any choice of a. 

We will return to Polya's urn later, but for now let us go back to general graphs. 



4. Partial exchangeability 

Another way to understand why LRRW is special is that it has a property called 
partial exchangeability. This means that for a given path, its probability depends 
on how many times each edge is visited, but not on the order in which these visits 
happen. Figure 2 shows two such paths in a triangle. Both paths have length 4, 
and both cross one edge twice and the other two edges once, but are different. The 
numbers in the figure are the probability of each traversal for LRRW with a = 1. 
The reader can verify that the numbers are not the same, but their product is 

1112 12 11 
2'3'3'4~2'3'I'3 

Indeed the numerators on the left are a permutation of the numerators on the 
right; and ditto for the denominators. The general proof is the same: 

Theorem 5. LRRW is partially exchangeable. 

Proof. We may assume without loss of generality that the reinforcement function 
is n + a for some a > 0. Let 7 be a path in the graph G. Then the probability 
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p(j) that 7 = (70, . . . , 7;) is the beginning of the LRRW path is 



i-i 



q + iV 7 ((7i,7i + i),i) 
X; a + 7V 7 ((7 l; y),i) 



P(7) = I 



where A 7 is of course the analog of N above for the path 7 i.e. iV 7 (e, i) = #{1 < 
s < t : (7s_i,7«) = e}. Now, if N~(e,l) = q for some g, then the numerators 
contain the terms a, a + 1, ...,a + q at the q places where e is an edge of 7. 
This is not specific to linear reinforcement. Now assume that the vertex v 7^ xq 
appears r times in the path 70, ... , 7;_i (note that we do not count 7;). Then the 
denominators contain a deg v + 1, a deg v + 3, . . . , a deg v + 2r — 1 at the r places 
where 7$ = v. For v = xo you get a similar picture, except the denominators 
contain a deg xq, . . . , a deg xq + 2i — 2. In other words it does not matter at which 
order did 7 traverse the edges coming out of v, since each such traversal increases 
the total weight of the edges surrounding v by 1 (and another 1 is added when 
returning to the vertex). This is the point which is special to linear reinforcement. 

Hence, if you know iV 7 (e, I) for all e you can reconstruct both numerators and 
denominators up to a permutation (this requires the observation that you can 
identify the last vertex of 7, since it is the only one with odd total number of 
appearances, or xq if no vertex has this property). Thus you can calculate ^(7) 
from 7V 7 , and the process is partially exchangeable. □ 

Partial exchangeability is interesting because of the following theorem of Dia- 
conis & Freedman [4] 

Theorem 6. Let x = {xo, Xi, . . . } be a random sequence of elements in some state 
space, which is recurrent. Then x is partially exchangeable if and only if it is a 
mixture of Markov chains. 

Here recurrent means that the sequence returns to its starting vertex with 
probability 1. This has to be specified as for general random sequences the various 
definitions of recurrence are not identical. In fact, the counterexamples in [4] which 
show that recurrence is necessary consist of two states and a process that gets stuck 
in the second state. A mixture of Markov chains means that there is some measure 
/i on the space of Markov chains such that the probability of every event E is given 



where W is a Markov chain and V W (E) is the probability of E under the Markov 
chain W. The same concept is known under the name of random walk in random 
environment. Again, you first pick an environment W randomly, then perform 
random walk on it. We already mentioned random walk in random environment 
when we discussed the directed case above. But there each vertex was independent, 
while in the general case this is typically not the case. 

We will not prove the theorem, but just make some remarks on its proof. 
Recurrence is used to break the infinite process into blocks, from one visit of xq to 
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the next. These blocks then turn out to be fully exchangeable, i.e. any permutation 
has the same probability. This allows to apply de Finetti's theorem which states 
that such a process is a mixture of i.i.d. processes. De Finetti's theorem itself is 
not difficult either, it is an application of Krein-Milman. The point to note here 
is that the proof is soft and abstract, without any explicit calculations. 

Let us demonstrate the utility of this fact by applying it to Polya's urn, i.e. to 
LRRW on a line of length 3 with a = 2. There is no question as to recurrence, 
since our walker returns to its starting point every 2 steps without failure. Our 
random environment can only relate to the central vertex, so asking "what is the 
random environment?" is equivalent to asking "what is the probability to go left 
from the central vertex?" . In other words, there is only one parameter. Let us 
denote it by p. 

Theorem 7. The parameter p defined above is uniform in [0, 1]. 

Proof. Fix p and examine the Markov chain on a line of length 3 which moves from 
the central vertex to the left vertex with probability p and to the right vertex with 
probability 1 — p. Denote by L t the number of times the left edge was crossed by 
time t. By the law of large numbers we know that 

lim — - = p almost surely. 

t— >oo t 

The random walk in random environment tells us that for the measure p, that we 
seek, 

v{heE)=j\ P (±,E)d» { p). 

This holds for every measurable E, but let us restrict our attention to intervals. 
By the explicit solution of Polya's urn (theorem 3 above), the left hand side is 
l-El + 0(l/t) where \E\ is the Lebesgue measure of E. Taking t —> oo (which 
can be done inside the integral as fi is a probability measure and the integrand is 
bounded) gives 

\E\= f i{peE}dfi( P ). 
Jo 

Since this holds for all intervals E, we get that fi is uniform, as needed. □ 

In other words, the theorem of Diaconis and Frccdman allows us to get from the 
"static" description of the Polya urn we had before, a "dynamic" description which 
is fuller. We now know that Polya's urn is equivalent, as a process, to choosing 
some number p £ [0, 1] uniformly and then picking each time a black ball with 
probability p and a white ball with probability 1 — p. 

In general graphs it proved to be a significant challenge to find an explicit 
formula for the environment, and this is what we turn to next. 
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5. The magic formula 

Let us consider for a moment finite graphs. Even a triangle would prove to be 
highly nontrivial! LRRW is recurrent on any finite graph — this is not difficult to 
show, but we will skip it here. The theorem of Diaconis & Freedman promises us 
that some random environment exists such that LRRW is equivalent to random 
walk in the environment. But what is it? It turns out there is a formula for it, 
known fondly as the "magic formula" . The history of its discovery is complicated 
and the reader would be best served by reading about it in detail in [18]. It 
turns out that the environment is always reversible (a fact which does not follow 
from Diaconis & Freedman) . This means that there is a (random) weight function 
W : E — > [0, oo ) where E is the set of edges of G such that the probability to go 
from x to y is 

W(x,y) 

£ w(x,zy 

It is common to denote the denominator by W(x) and wc shall do so below. 
Reversibility is perhaps the most important property a Markov chain may or may 
not possess. Most notably a reversible Markov chain gives rise to a self-adjoint 
operator (on I 2 with the weight being W) which is amenably to spectral analysis. 

Now, W is not unique since multiplying W by a constant gives rise to ex- 
actly the same Markov chain. So we need to normalize and we normalize so that 
J2 x ~yW(x,y) = 1. For completeness, let us state the magic formula for non- 
constant initial weights, i.e. assume for every edge e there is some a e and the 
transition probabilities at time t are a e + N(e,t), normalized. The magic formula 
is then: 

Theorem 8. Let G = (V, E) be a finite graph. Examine LRRW on G, starting 
from Xo with initial weights a e , and let \i be the corresponding random environment. 
Then the density of /i at a given W is given by 

1 eG-E 

Z W{x )^ W{v)^ +1) 

v£V\{x } 

where the sum inside the square root is a sum over all spanning trees of G; where 
Z is a normalization constant; and where a v = ^2 e3v a e . 

The most problematic term is the sum over trees, as it introduces long-range de- 
pendencies. By the matrix-tree theorem this sum can be written as a determinant, 
but that does not simplify the formula significantly. 

As an exercise, let us analyse Polya's urn again: 

Second proof of theorem 7. We use the magic formula. All a e = 2, a Xa = 4 and 
for v ^ xq (i.e. one of the side vertices), a v = 2. Since the normalization is 



/£ n w ^ 
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Reinforced random walk 



11 



W(e) = 1, we have W(xq) = 1. Denote the weight of the left edge by p, so for 
one of the side vertices W(v) = p while for the other it is I— p. All in all this gives 



Z ■ l2.p3/2 (1 _ p )3/2VP^ P) 

The expression inside the square root is so simple because a line graph has exactly 
one spanning tree (the graph itself) , so the sum is only over one term. Everything 
cancels and we get that the density is -g-, which of course means that Z = 1, 
proving the theorem. □ 

Throughout the last decade the magic formula stood in the center of research 
on LRRW. Even the case of the ladder was not so easy, and was done in [19]. The 
next result achieved was a tightness result that allowed to show that LRRW is 
always equivalent to a random walk in random environment, even in the transient 
case (which is not covered by Diaconis & Frccdman) [20]. The most impressive 
result, also due to Mcrkl & Rollcs [21], was the proof that a certain two-dimensional 
graph is recurrent. A conjecture that goes back to the 80s [22, page 1241] is that 
LRRW on Z 2 is recurrent, at least for a sufficiently small. Mcrkl & Rolles got 
quite close, but their proofs required to replace each edge by a line of length 130 
or more. 

Around the same period a surprising connection to certain quantum models 
appeared, a topic we will discuss next. 



6. The hyperbolic sigma model 

The hyperbolic sigma model has its roots in physics. As some of the readers (and, 
frankly, the author) may not be familiar with the necessary background, we will 
be short and vague. Hopefully the description will still be useful to some. Our 
starting point is supersymmetry. 

Supcrsymmctry started its life as a branch of particle physics that postulated 
that each of the known particles has a partner (a "superpartner" ) which is similar 
in most aspects, but the superpartner of a fcrmion is a boson and vice versa. 
The existence of these particles leads to cancellations which would solve several 
theoretical problems with the current model. These superpartners were never 
found in particle accelerators, but the mathematics of these cancellations turned 
out to be very fruitful. 

One potential application of supersymmetry to problems which have nothing 
to do with bosons or fermions is the Wegner-Efetov approach to random matrices 
and to certain disordered quantum systems [8] . The titular hyperbolic sigma model 
captures some of the mathematical difficulties of this approach, and, conjecturally, 
most of the phenomenology. It has been studied by Disertori, Spencer & Zirnbauer 
[26, 5, 6]. The model, roughly, is as follows. We are looking at functions a from 
the vertices of our graph (say Z d ) into hyperbolic space. The action (or energy) 
A(a) is a sum over the edges of some interaction which respects the symmetries 
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of the hyperbolic space. Then each a is given probability density proportional to 
e~P A where (3 is a parameter which is analogous to the initial weights a in LRRW. 

The approach taken in [26] to tackle the hyperbolic sigma model is to make a 
change of variable in the hyperbolic plane (change to the so-called horospherical 
coordinates) and then integrate some of the variables. The result was a sigma 
model with values in R but with a much more complicated interaction term, which 
included a determinant very similar to the one appearing in the magic formula. 
This caused significant curiosity but for a few years it was not clear whether the 
similarity is coincidental or not. This mystery was solved in November 2011. 

7. The solution 

It was Sabot and Tarres who managed to make the connection between LRRW 
and the hyperbolic sigma model rigorous [25]. They investigated a variation on 
LRRW called the vertex-reinforced jump process, defined in continuous time. On 
the one hand the vertex-reinforced jump process has a random walk in random 
environment representation with the density of each environment identical to the 
formulas that appear in the hyperbolic sigma model. This allowed them to harness 
the results of [5] to show that the vertex-reinforced jump process is recurrent for 
small a in any graph (the jump process has a parameter which acts analogously to 
a for LRRW). On the other hand, they showed that LRRW is given by a vertex- 
reinforced jump process with random weights. Thus, after some modification of 
the techniques of [5] to work with different, unbounded (3 in different edges, they 
were able to show: 

Theorem 9. For every graph G there exists some ao such that LRRW is recurrent 
for all a < oq. 

Simultaneously, with Angel and Crawford, we arrived at the same result [1]. 
Our approach was completely different and did not use the magic formula at all. 
The basic idea is very simple. 

Basic idea of the proof. Examine some edge e = (x,y) and assume you have tra- 
versed e for the first time from x to y. Let / be the edge through which you arrived 
into x. If a is extremely small, then upon arriving to x you see one edge, /, with 
weight 1 + a, and all the others with weight a. Hence you are very likely to return 
via /. This will repeat on the second, third etc. visit to x. At each time, because 
/ has much larger weight than e (its weight is even increasing, but we will not use 
that) , it will choose to exit x through / multiple times until finally exiting through 
e. Denote the number of times it exited through / before the first exit through e 
by N. 

We now recall that the process is also a random walk in random environment. 
The environment might be complicated, but we have earned a bit of information 
about it. We know that the walker exited the vertex x N times through / before the 
first time it exited through e. This must mean that the weight of / is approximately 
N times larger than that of e! In other words, we see that the weight of edges 
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decays exponentially in the "first entrance distance" , i.e. if we define fi as the edge 
through which we entered the other vertex of /, /3 as the edge through which we 
entered the other vertex of ji etc., we finally reach /„ = Xq. Then the argument 
says that the weight of e should decay exponentially in n. □ 

Comparing to the approach of Disertori-Spencer-Sabot-Tarres, our approach 
is much softer, and has almost no explicit calculations. We rely on the existence 
of a random walk in random environment representation, i.e. on the theorem of 
Diaconis & Freedman, but that too, as explained above, is a soft result. 

Trying to formalize this leads to some minor technical difficulties. The first 
is that, to get exponential decay, different edge ratios need to be independent. 
However, different edges are not independent, because the "first entrance path" is 
a global property which introduces dependencies between all quantities. To tackle 
this difficulty we use a somewhat brutish approach and simply count over all such 
paths, i.e. write 

E (W e ) = E (W e ■ 1{7 is the first entrance path}) . 

7 

This allows to just fix two edges e and / sharing a vertex x, and define a variable 
as in the basic idea. These variables (call them R e j) are independent and have 
expectation < C^fa. Thus we argue, very roughly, as follows 

E (W e ■ 1{7 is the first entrance path}) = 

(len 7— 1 \ 
-^(7i-i,7i),(7i,7i+i) ' -H7 i s the first entrance path} ) < 

(Icn 7 — 1 \ 
II R i^-md, (7*,7*+l) J = n Ei? (^-i,7.),(7*,7 I+ l) - (CV") 0117 • (!) 

The first equality is because under the event that 7 really is the first entrance 
path, W e is really the product of the R e j along 7. The inequality that follows 
comes from just throwing this event away. This seems very wasteful, but look at 
the final result: we show that the expectation is exponentially small, and not only 
that, we can make the exponent base as small as we want by reducing a. So losing 
a fixed exponential term by counting over the 7 is something that our approach 
can sustain. The third step in (1) is simply independence, and the fourth is the 
estimate that came from the "basic idea" sketch above. 

Another technical difficulty that might be worth mentioning is that, even 
though the weights typically decrease along the first entrance path, sometimes 
they increase. Once you incorporate this phenomenon into the basic sketch, you 
sec that the R e j, despite being typically Cy/a, in fact have infinite expectation. 
We solve this by taking everything to some small fixed power (j worked). Thus 
(1) as written is false, and what we actually prove is 

E(V e 1/4 ) < {C^) dist ^ Xo) . 
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Returning to our favourite example, Polya's urn, we see that the phenomenon is 
real: the ratio between the weights of the two edges has no first moment. So some 
power must appear in the formulation of the result, though i is not optimal. With 
these two problems solved, we were able to show the exponential decay of weights. 
Let us state this formally: 

Theorem 10. Let G be a graph with all degrees bounded by K . Then there exists 
ao = ao(K) > such that for all a € (0, ao), 

(\ / \ dist(e,i>o) 

8. Where now? 

The next step to understand the phase transition is to understand the transient 
regime. Using our techniques we were able to show the existence of a transient 
regime on any non-amenable graph. But getting to Z d seems to require a new idea. 

Next one would like to show that the phase transition is unique, namely that 
there is some a c such where the process moves from recurrence to transience, and 
not, say, that the process switches from recurrence to transience and back a few 
times before finally becoming transient for high enough a. In many models proving 
this kind of fact requires monotonicity, but there are also other approaches, so e.g. 
[10]. 

There are also many other models we would like to know similar results. We 
already covered other reinforced walks, but the connection to the hyperbolic sigma 
models suggests many more questions. Arc there other sigma models where one can 
get an interacting representation for random walk on the generated field? All-in-all 
this is an exciting and fast-moving field, with a lot more to understand. 
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